COMPETENT BACTERIA 
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60/409,089 filed September 6, 2002. 
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OR DEVELOPMENT 

[0002] This invention was made with United States government support awarded by the 

following agency: 

NIH GM35682 
The United States has certain rights in this invention. 

BACKGROUND OF THE INVENTION 
[0003] Bacteria have been used to produce a wide range of commercial products. For 

example, many Streptomyces strains and Bacillus strains have been used to produce antibiotics; 
Pseudomonas denitrificans and many Propionibacterium strains have been used to produce 
vitamin B 12; some other bacteria have been used to produce vitamin Riboflavin; Brevibacterium 
flavum and Corynebacterium glutamicum have been used to produce lysine and glutamic acid, 
respectively, as food additives; other bacteria have been used to produce other amino acids used as 
food additives; Alcaligenes eutrophas has been used to produce biodegradable microbial plastics; 
and many Acetobacter and Gluconobacter strains have been used to produce vinegar. More 
recently, it has become common for bacteria, such as Escherichia coli {E. coli), to be genetically 
engineered and used as host cells for the production of biological reagents, such as proteins and 
nucleic acids, in laboratory as well as industrial settings. The pharmaceutical industry supports 
several examples of successful products which are human proteins which are manufactured in E. 
coli cultures cultivated in a fermenter. 

[0004] It is not an uncommon occurrence for normal bacterial proteins to adversely affect 

the production or the purification of a desired protein product from an engineered bacteria. For 
example, when E. coli bacteria are used as host cells to generate in large quantity of a desired 
product encoded by a gene that is introduced into the host cells by a plasmid, certain normal E. 
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coli gene products can interfere with the introduction and maintenance of plasmid DNA. More 
significantly, because of the economies of bacterial culture in making proteins in bacteria, often 
the cost of purification of a recombinant protein can be more than the cost of production, and 
some of the natural proteins produced by the bacterial host are sensitive purification problems. 
Many bacterial strains produce toxins that must be purified away from the target protein being 
produced and some strains can produce, by coincidence, native proteins that are close in size to the 
target protein, thereby making size separation not available for the purification process. 
[0005] Also, however, the genome of a bacteria used in a fermenter to produce a 

recombinant protein includes many unnecessary genes. A bacteria living in a natural environment 
has many condition responsive genes to provide mechanisms for surviving difficult environmental 
conditions of temperature, stress or lack of food source. Bacteria living in a fermentation tank do 
not have these problems and hence do not require these condition responsive, genes. The bacterial 
host spends metabolic energy each multiplication cycle replicating these genes. Thus the 
unnecessary genes and the unneeded proteins, produced by a bacterial host used for production of 
recombinant protein, simply represent lack of efficiencies in the system that could be improved 
upon. 

[0006] It is not terribly difficult to make deletions in the genome of a microorganism. One 

can perform random deletion studies in organisms by simply deleting genomic regions to study 

what traits of the organism are lost by the deleted genes. It is more difficult, however, to make 

targeted deletions of specific regions of genomic DNA and more difficult still if one of the 

objectives of the method is to leave no inserted DNA, here termed a "scar," behind in the 

organism after the deletion. If regions of inserted DNA, i.e. scars, are left behind after a genomic 

deletion procedure, those regions can be the locations for unwanted recombination events that 

could excise from the genome regions that are desirable or engender genome rearrangements. 

Since in building a series of multiple deletions, scars left behind in previous steps could become 

artifactual targets for succeeding steps of deletion. This is especially so when the method is used 

repeatedly to generate a series of deletions from the genome. In other words, the organism 

becomes by the deletion process genetically unstable if inserted DNA is left behind. 

[0007] Another attribute of interest in bacterial strains of laboratory and industrial use is 

what is referred to as transformation competence. It is desirable to introduce exogenous DNA into 

bacteria in culture so that the bacteria will then stably maintain and reproduce the introduced 

DNA. Bacterial strains vary in their ability to take up and maintain foreign DNA. This 

characteristic is their transformation competence. 
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[0008] Certain treatments and culture conditions can affect transformation competence. In 

general, E. coli cells grown in normal growth medium, unlike certain bacilli, do not take up 
exogenous DNA. It was discovered by Mandel and Higa (J Mo I Biol 53: 159-162 (1970)) that 
treatment of E. coli with calcium chloride allowed E. coli to take up DNA from bacteriophage 
lambda. Cohen et al (Proc Nat Acad Sci 69 : 2 1 1 0 ( 1 972)) observed that E. coli could be 
transformed by this method with plasmid DNA. The state of E. coli induced by this treatment, in 
which they become able to take up DNA became known as "transformation competence" and the 
cells as "transformation competent cells" or more simply, as "competent cells". Under the Mandel 
and Higa conditions, the yield of transformants is typically in the region of 10 5 to 10 6 
transformants per microgram of plasmid DNA. 

[0009] Since the early observations of Mandel and Higa, many variations of the procedure 

have been tried in attempts to increase the efficiency of the procedure, in order to maximize the 
number of transformants per unit input of DNA, and to improve the reproducibility of the 
procedure notably by Hanahan and coworkers (JMol Biol 166: 557-580). The method of 
Hanahan is used widely to induce competence. Typically, the yield of transformants using the 
described method is 10 5 to 10 8 transformants per microgram of plasmid DNA. 
[0010] Generally, researchers have found that the transformation efficiency of competent 

cells is very variable and it has proven difficult to achieve reproducible results (discussed by 
Hengen, P.N., Trends in Biochemical Sciences 19:426-427 (1994) and Trends in Biochemical 
Sciences 21:75-76 (1996)). It has been found that growth at lower than normal temperature (18°C 
to 32°C) produces higher competence (US 4,981,797 and Inoue H et al, Gene 96:23-28 (1990). 
There is a need for procedures that reproducibly produce competent cells with high transformation 
efficiency. 

[0011] Transformation competence is a variable from strain to strain of bacterial hosts. 

Hanahan found that strain MM294 transformed readily, and developed a derivative strain, DH1, 
from which many of the strains in use today have been derived. Introduction of the deoR mutation 
(in strain DH5) appeared to increase the transformation efficiency (Hanahan Methods in 
Enzymology 204:63-1 13 (1991) and US 4,851,348) though the molecular mechanism of the 
improvement is still unknown. DH5 has been used extensively for transformation. Commercial 
strains have also been derived from strain MM294, via strain JM109 and strain AG1 . Strain AG1 
contains one or more additional uncharacterized mutations that increases transformation efficiency 
(Bullock et al, Biotechniques 5:376 (1987)). The commonly used strain DHlOb, which also 
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contains the deoR mutation, was 30 times better than DH1 in transformation with 66kb plasmid 
DNA. Its pedigree is different from DH1 and DH5, though they share common ancestors. All of 
the commonly used strains have undergone numerous mutational and recombinational treatments ) 
and are very distant from wild-type E. coli (Bachman, B Derivation and Genotypes of Some 
Mutants and Derivatives ofE. coli K12, Vol 2, chapter 133, 2460-2488, in: Escherichia coli and 
Salmonella, Neidhardt, F, editor, 2 nd Edition, ASM Press, Washington (1996)). There is a need 
for better characterized strains for transformation work that are nearer to wild-type. 
[0012] One of the most effective methods of introducing DNA into E. coli consists of 

subjecting the cells to an electric field while they are suspended in a defined medium of low ionic 
strength containing the DNA it is desired to introduce. This process is known as 
"electroporation" The transformation efficiency, that is the number of cells that are stably 
transformed by a unit measure of DNA, may be higher than in competent cells produced by other 
methods. A high and reproducible efficiency of transformation is desirable because it facilitates or 
in some cases makes possible cloning of rare and valuable DNA molecules, particularly those 
available in only small amounts. In particular electroporation is the preferred process for 
preparing cells for use in cloning bacterial artificial chromosomes ("BACs") and other large DNA 
vectors commonly used in genomic DNA sequencing. 

[001 3 J In order to be competent for transformation using the electroporation method, 

known as "electroporation competent" or "electro-competent", the E. coli cells need to be grown 
under certain specific conditions and subjected to certain specific treatments. Methods for 
inducing electroporation competence are described in Dower et al (Nucleic Acid Research 
16:6127-6145 (1988)), Calvin and Hanahan (JBacteriol 170:2796-2801 (1988)) and US 
4,910,140 and US 5,186,800. These publications also discuss the many factors that affect the 
competence and transformation efficiency of E. coli. 

[0014] However, the methods described in the literature for preparing electroporation 

competent cells are not ideal. The efficiency of transformation is variable depending upon the 

strain of bacteria used, the conditions used to grow the bacteria and the conditions used to expose 

the bacteria to the electric field. In particular, only two growth media have been disclosed for the 

preparation of competent cells: Luria-Bertani broth, commonly known as 'LB broth' (Miller JH 

(1972) Experiments in Molecular Biology, Cold Spring Harbor Laboratory, New York) and SOB 

medium, described by Hanahan (JMol Biol 166: 557-580). The composition of LB (per liter of 

medium) is Bacto Tryptone 10 g, Bacto Yeast Extract 5g and sodium chloride lOg. The 

composition of SOB (per liter of medium) is Tryptone 20g, Yeast Extract 5g, sodium chloride 
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0.5g, magnesium sulphate 2.4g and potassium chloride 0.126g. Neither medium has any 
additional carbon source, such as glucose or other carbohydrate, or glycerol, or additional pH 
buffering component such as phosphate or non-metabolisable compounds such as MOPS or Tris. 
The amino acid compounds in the tryptone and yeast extract serve as both nitrogen and carbon 
sources. 

[0015] The stage of the growth cycle at which cells are harvested has been described as 

critical to success, and there is a narrow window in the cell concentration at harvest, above which, 
or below which, the transformation efficiency drops off rapidly (Calvin and Hanawalt (1988); 
Dower et al (1988), supra). In LB medium, the optimal stage of the growth cycle occurs at a cell 
concentration corresponding to an optical density of 0.6. This is a relatively low cell density, 
since E. coli can grow to cell densities many times higher than this under appropriate fermentation 
conditions, well known to those trained in the art of fermentation. As a result, the process of 
preparing electroporation competent cells is wasteful of expensive growth media, fermentor 
capacity and time, and operator time, compared with a process that could use cells grown to higher 
cell densities. It also requires more expensive downstream processing equipment and time, since 
the competent cells need to be recovered from larger volumes of dilute suspension. There is a 
need for better methods for preparing electroporation competent cells, in particular it would be 
economically beneficial to have a process in which cells can be grown to much higher cell 
densities without losing transformation efficiency. 

[0016] As further evidence of the difficulties scientists face in making competent cells, a 

market has developed for pre-prepared, frozen competent E. coli cells that exhibit a higher, more 
consistent and reproducible efficiency of transformation. In producing commercial competent 
cells, manufacturers have resorted to several conditions to improve their processes. US 4,981,797 
discloses an improved process operating at temperatures between 1 8°C and 32°C, but nevertheless, 
cells need to be harvested at low cell concentrations. US 6,040,184, US 6,338,965 and WO 
0022147 disclose the addition of sorbitol or other sugars to improve the transformation efficiency. 
WO 0109362 discloses a procedure to select variant bacteria that are more tolerant of the killing 
effect of the high field electrical pulse used in the electroporation procedure. 

BRIEF SUMMARY OF THE INVENTION 
[0017] The present invention provides a bacterium having a genome that is genetically 

engineered to be at least two percent (2%) to fifteen percent (15%) smaller than the genome of its 



-5- 



native parent strain. When used to produce a product, a bacterium with a smaller genome can 
have one or more of the following advantages. One, the production process can be more efficient 
either in terms of resource consumption or in terms of production speed, or both. Two, the 
product purification process can be simplified or purer products can be made. Three, a product 
that cannot be produced before due to native protein interference can be produced. Fourthly, as 
described here, the reduced genome bacteria has an improved competence for transformation. 
[0018] The present invention is also directed toward a method for culturing bacteria for 

transformation events. The method also results in improved transformation efficiencies and can 
be used with or without the reduced genome bacteria. 

[0019] It is an advantage of the present invention that the reduced genome bacteria and the 

method for culturing the bacteria can be used together to maximize transformation efficiency. 
[0020] Other objects, features and advantages of the invention will become apparent upon 

consideration of the following detailed description. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 
[0021] Fig. 1 shows positions of the genes and other DNA sequences on E. coli K-12 

bacterial genome that were candidates for deletion as black and lighter hatched boxes on the 
outermost ring. 

[0022] Fig. 2 illustrates a specific example of a linear DNA-based scarless genetic 

modification method of the present invention. 

[0023] Fig. 3 illustrates a specific example of another linear DNA-based method of the 

present invention. 

[0024] Fig. 4 shows a mutagenesis plasmid that can be used in the linear DNA-based 

method illustrated in Fig. 3. 

DETAILED DESCRIPTION OF THE INVENTION 
[0025] In this specification, we address the transformation competence of bacterial cells in 

general, and in particular cells of E. coli. First it was found that E. coli strains which had the size 
of their genomes reduced by a series of deletions of genetic material also proved to be highly 
competent for transformation events. Second, in the course of working with the transformation 
competent E. coli, a methodology was developed for culturing bacterial strains to prepare them for 
transformation. These two insights are preferable used together, but are also useful separately. 
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The reduced genome E, coli have improved transformation competence regardless of the methods 
used and the method can be used to render other bacterial strains more competent. However, the 
best transformation efficiencies will be recognized by using the reduced genome E. coli strains in 
the methodology described here. While the description of the method for reducing the genome of 
E. coli are lengthy, that aspect will be discussed here first, after which the discussion will return to 
transformation competence and the methodology. 

[0026] The workers here developed several methods for deleting genomic DNA from a 

genome without leaving any inserted DNA behind. If one is making several sequential deletions 
from the single DNA molecule which makes up a bacterial genome, it is important not to leave 
any inserted DNA sequences behind. Such inserted sequences, if they were left behind, would be 
candidate sites for undesired recombination events that would delete uncharacterized and perhaps 
important portions of the remaining genome from the bacteria or cause other unanticipated 
genome rearrangements with untoward effects. Since one of the objectives of the genome 
reduction effort is to increase the genetic stability of the bacteria, leaving any inserted DNA 
behind would be contrary to the objective, and should be avoided. Thus the methods used to 
delete DNA from the genome become important and sophisticated. 

[0027] In one aspect, the present invention relates to a bacterium having a genome that is 

genetically engineered to be smaller than the genome of its native parent strain. For exemplary 
purposes, the work described here has focused on the common laboratory and industrial bacterium 
Escherichia coli. The genome reduction work described here began with the laboratory E. coli 
strain K12, which had prior to the work described here, a genome of 4,639,221 nucleotides or base 
pairs. The bacterium of the present invention can have a genome that is as much as 14% to 16%, 
smaller than the genome of its native parental strain. We have so far reduced the genome of E. 
coli K12 by a little over fourteen percent (14.1%), without disabling the bacteria from its protein 
production utility. The term "native parental strain" means a bacteria strain found in natural or 
native environment as commonly understood by the scientific community and on whose genome a 
series of deletions can be made to generate a bacterial strain with a smaller genome. The 
percentage by which a genome has become smaller after a series of deletions is calculated by 
dividing "the total number of base pairs deleted after all of the deletions" by "the total number of 
base pairs in the genome before all of the deletions" and then multiplying by 100. The strain of E. 
coli with the 14% genome reduction has been named MDS39, so-named because thirty-nine 
deletions of material from the genome been performed from the parental strain MG1655. 
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[0028] Below, E. coli is used as an example to illustrate the genes and other DNA 

sequences that are candidates for deletion in order to generate a bacterium that can produce a 
desired product more efficiently. The general principles illustrated and the types of genes and 
other DNA sequences identified as candidates for deletion are applicable to other bacteria species 
or strains. It is understood that genes and other DNA sequences identified below as deletion 
candidates are only examples. Many other E, coli genes and other DNA sequences not identified 
may also be deleted without affecting cell survival and proliferation to an unacceptable level. 
[0029] It is assumed in the analysis and methodology described below that the DNA 

sequence of the target bacterial strain is available. The full genomic sequence of several strains of 
E. coli is, of course, now published (for example, Blattner et al, Science, 277:1453-74, 1997; 
Perna et al, nature, 409, 529-533, 2001; Hayashi et al, DNA Res., 8, 1 1-22, 2001), as is the 
sequence of several other commonly used laboratory bacteria. To start the deletion process, the 
genome of the bacteria is analyzed to look for those sequences that represent good candidates for 
deletion. Of course, these techniques can also be applied to partially sequenced genomes in the 
genomic areas for which sequence date is available or could be determined. 
[0030] In E. coli, and other bacteria as well, as well as in higher organisms, a type of DNA 

sequence that can be deleted includes those that in general will adversely affect the stability of the 
organism or of the gene products of that organism. Such elements that give rise to instability 
include transposable elements, insertion sequences, and other "selfish DNA" elements. For 
example, insertion sequence (IS) elements and their associated transposes are often found in 
bacterial genomes, and thus are targets for deletion. IS sequences are common in E. coli, and all 
of them may be deleted. For purposes of clarity in this document, we use the term IS element 
genetically to refer to DNA elements, whether intact or defective, that can move from one point to 
another in the genome. An example of the detrimental effects of IS elements in science and 
technology is the fact that they can hop from the genome of the host E. coli into a BAC plasmid 
during propagation for sequencing. Many instances are found in the human genome and other 
sequences in the GenBank database. This artifact could be prevented by deletion from the host 
cells of all IS elements. For a specific application, other specific genes associated with instability 
may also be deleted. 

[0031] Shown in Fig. 1 is illustration of the E. coli genome, which natively, in the K12 

strain, comprises 4,639,221 base pairs. Fig. 1, shows, on the inner ring, the scale of the base pair 
positions of the E. coli K12 genome (strain MG1655), scaled without deletions. The next ring 
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progressively outward shows regions of the K12 genome that are missing or highly altered in a 
related strain 0157:H7, and which are thus potentially deletable from the K12 genome. The next 
ring outward shows the positions of the IS elements, both complete and partial, in the native 
genome. The next ring moving outward shows the positions of the RHS elements A to E and 
flagellar and restriction regions specially targeted for deletion here. The outermost ring shows the 
location of the deletions actually made to the genome, as also listed in Tables 1 and 2 below. 
These deletions make up about 14 percent of the base pairs in the original K12 MG 1655 genome. 
[0032] Another family of E. coli genes that can be deleted is the flagella gene family. 

Flagella are responsible for motility in bacteria. In natural environments, bacteria swim to search 
for nutrients. In cultured environments, bacteria motility is not important for cell survival and 
growth and the swimming action is metabolically very expensive, consuming over 1% of the 
cellular energy to no benefit. Thus, the flagella genes may be deleted in generating a bacterium 
with a smaller genome. Positions of flagella genes on an E. coli genome map are shown in Fig. 1 
and Table 1. 

[0033] Another family of E. coli genes that can be deleted is the restriction modification 

system and other nucleases whose products destroy foreign DNA. These genes are not important 
for bacterial survival and growth in culture environments. These genes can also interfere with 
genetic engineering by destroying plasmids introduced into a bacterium. Thus, these genes can be 
deleted in generating a bacterium with a smaller genome. Positions of restriction modification 
system genes on an E. coli genome map are shown in Fig. 1 and Table 1 . 

[0034] One type of E. coli DNA element, already mentioned, that can be deleted is the IS 

elements. IS elements are not important for bacteria survival and growth in a cultured 
environment and are known to interfere with genome stability. Thus, the IS elements can be 
deleted in generating a bacterium with a smaller genome. Positions of the IS elements on an E. 
coli genome map are shown in Fig. 1 and Table 1 . 

[0035] Another type of E. coli DNA element that can be deleted is the Rhs elements. All 

Rhs elements share a 3.7 Kb Rhs core, which is a large homologous repeated region (there are 5 
copies in E. coli K-12) that provides a means for genome rearrangement via homologous 
recombination. The Rhs elements are accessory elements which largely evolved in some other 
background and spread to E. coli by horizontal exchange after divergence of E. coli as a species. 
Positions of the Rhs elements on an E. coli genome map are shown in Fig. 1 and Table 1 . 
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[0036] One type of region in the E. coli genome that can be deleted is the non-transcribed 

regions because they are less likely to be important for cell survival and proliferation. Another 
type of regions in the E. coli genome that can be deleted is the hsd regions. The hsd regions 
encode for the major restriction modification gene family which has been discussed above. 
Positions of the non-transcribed regions and the hsd regions on an E. coli genome map are shown 
in Fig. 1 and Table 1 . 

[0037] One general method to identify additional genes and DNA sequences as deletion 

candidates is to compare the genome of one bacterial strain to another. Any DNA sequences that 
are not present in both strains are less likely to be functionally essential and thus can be used for 
identifying candidates for deletion. In the examples described below, the complete genomic 
sequences of two E. coli strains, 0157:H7 EDL933 and K-12 MG1655, were compared. DNA 
sequences that were not found in both strains were used to identify targets for deletion. Twelve 
such identified targets from E. coli strain MG1655 were deleted, resulting in a bacteria strain with 
a genome that is about 8% smaller. The bacteria with the reduced genome are alive and grow at 
substantially the same rate as the native parent MG1655 strain. 

[0038] One can test the consequence of deleting one or several genes or other DNA 

sequences from the genome. For example, after one or several genes or other DNA sequences of 
the genome have been deleted, one can measure the survival and proliferation rate of the resultant 
bacteria. Although most of the above-identified genes or other DNA sequences may be deleted 
without detrimental effect for purpose of producing a desired product, it is possible that the 
deletion of a specific gene or other DNA sequence may have an unacceptable consequence such as 
cell death or unacceptable level of reduction in proliferation rate. This possibility exists because 
of redundancies in gene functions and interactions between biological pathways. Some deletions 
that are viable in a strain without additional deletions will be deleterious only in combination with 
other deletions. The possibility exists also because of certain methods used to identify deletion 
candidates. For example, one method used to identify deletion candidates is to compare two E. 
coli strains and select genes or other DNA sequences that are not present in both strains. While 
the majority of these genes and other DNA sequences are not likely to be functionally essential, 
some of them may be important for a unique strain. Another method used to identify deletion 
candidates is to identify non-transcribed regions and the possibility exists that certain non- 
transcribed regions may be important for genome stability. 
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[0039] The consequence of deleting one or several genes or other DNA sequences to be 

tested depends on the purpose of an application. For example, when high production efficiency is 
the main concern, which is true for many applications, the effect of deletions on proliferation rate 
and medium consumption rate can be the consequence tested. In this case, the consequence tested 
can also be more specific as the production speed and quantity of a particular product. When 
eliminating native protein contamination is the main concern, fewer native proteins and lower 
native protein levels, or the absence of a specific native protein, can be the consequence tested. 
[0040] Testing the consequence of deleting a gene or other DNA sequence is important 

when little is known about the gene or the DNA sequence. Though laborious, this is another 
viable method to identify deletion candidates in making a bacterium with a reduced genome. This 
method is particularly useful when candidates identified by other methods have been deleted and 
additional candidates are being sought. 

[0041] When the consequence of deleting a gene or other DNA sequence has an effect on 

the viability of the bacteria under a set of conditions, one alternative to not deleting the specific 
gene or other DNA sequence is to determine if there are measures that can mitigate the detrimental 
effects. For example, if deleting lipopolysaccharide (LPS) genes results in poor survival due to 
more porous cellular membranes caused by the absence from the cellular membranes of the 
transmembrane domain of the LPS proteins, culture conditions can be changed to accommodate 
the more porous cellular membranes so that the bacteria lacking the LPS genes can survive just as 
well as the bacteria carrying the LPS genes. 

[0042] Methods for deleting DNA sequences from bacterial genomes that are known to one 

of ordinary skill in the art can be used to generate a bacterium with a reduced genome. Examples 
of these methods include but are not limited to those described in Posfai, G. et al., J. Bacteriol. 
179: 4426-4428 (1997), Muyrers, J.P.P. et al., Nucl Acids Res. 27:1555-1557 (1999), Datsenko, 
K.A. et al., Proc. Natl. Acad, ScL 97:6640-6649 (2000) and Posfai, G. et al., Nucl. Acids Res. 27: 
4409-4415 (1999), all of which are hereby incorporated by reference in their entirety. Basically, 
the deletion methods can be classified to those that are based on linear DNAs and those that are 
based on suicide plasmids. The methods disclosed in Muyrers, J.P.P. et al., Nucl. Acids Res. 
27:1555-1557 (1999) and Datsenko, K.A. et al., Proc. Natl. Acad. ScL 97:6640-6649 (2000) are 
linear DNA-based methods and the methods disclosed in Posfai, G. et al., J. Bacteriol. 179: 4426- 
4428 (1997) and Posfai, G. et al., Nucl. Acids Res. 27: 4409-4415 (1999) are suicide plasmid- 
based methods. 
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[0043] Some known methods for deleting DNA sequences from bacterial genomes 

introduce extraneous DNA sequences into the genome during the deletion process and thus create 
a potential problem of undesired homologous recombination if any of the methods is used more 
than once in a bacterium. To avoid this problem, scarless deletion methods are preferred. By 
scarless deletion, we mean a DNA sequence is precisely deleted from the genome without 
generating any other mutations at the deletion sites and without leaving any inserted DNA in the 
genome of the organism. However, due to mistakes, such as those made in PCR amplification and 
DNA repairing processes, one or two nucleotide changes may be introduced occasionally in 
scarless deletions. Described below are some novel scarless deletion methods, either linear DNA- 
based or suicide plasmid-based. These novel methods have been applied to E. coli strains in the 
examples described below. It is understood that the specific vectors and conditions used for E. 
coli strains in the examples can be adapted by one of ordinary skill in the art for use in other 
bacteria. Similar methods and plasmids can be used to similar effect in higher organisms. In 
some instances it may be more appropriate to modify an existing production strain rather than 
transfer production to the minimized genome E. coli strain. 
Novel linear DNA-based scarless deletion method I 

[0044] The novel DNA-based scarless deletion method of the present invention can be best 

understood when the following description is read in view of Fig. 2. Generally speaking, the 
method involves replacing a segment of the genome, marked for deletion, with an artificial DNA 
sequence. The artificial sequence contains one or more recognition sites for a sequence-specific 
nuclease such as I-Scel, which cuts at a sequence that does not occur natively anywhere in the E. 
coli K12 genome. Precise insertion of the linear DNA molecule into the genome is achieved by 
homologous recombination aided by a system that can increase the frequency of homologous 
recombination. When the sequence-specific nuclease is introduced into the bacteria, it cleaves the 
genomic DNA at the unique recognition site or sites, and only those bacteria in which a 
homologous recombination event has occurred will survive. 

[0045] Referring specifically to Fig. 2, the plasmid pSG76-CS is used as a template to 

synthesize the artificial DNA insert. The artificial insertion sequence extends between the 
sequences designated A, B and C in Fig. 2. The C R indicates a gene for antibiotic resistance. The 
insert DNA is PCR amplified from the plasmid and electroporated into the E. coli host. The insert 
was constructed so that the sequences A and B match sequences in the genome of the host which 
straddle the proposed deletion. Sequence C of the insert matches a sequence in the host genome 
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just inside sequence B of the host genome. Then the bacteria are selected for antibiotic resistance, 
a selection which will be survived only by those bacteria in which a homologous recombination 
event occurred in which the artificial DNA inserted into the bacterial genome. This recombination 
event occurs between the pairs of sequences A and C The inserted DNA sequence also includes a 
sequence B, now positioned at one end of the insert, which is designed to be homologous to a 
sequence in the genome just outside the other end of the insert, as indicated in Fig. 2. Then, after 
growth of the bacteria, the bacteria is transformed with a plasmid, pSTKST, which expresses the I- 
Scel sequence-specific nuclease. The I-Scel enzyme cuts the genome of the bacteria, and only 
those individuals in which a recombination event occurs will survive. 10-100% of the survivors 
are B to B recombination survivors, which can be identified by a screening step. The B to B 
recombination event deletes the entire inserted DNA from the genome, leaving nothing behind but 
the native sequence surrounding the deletion. 

[0046] To repeat, the first step of the method involves providing a linear DNA molecule in 

a bacterium. The linear DNA molecule contains an artificial linear DNA sequence that has the 
following features: one end of the linear DNA sequence is a sequence identical to a genome 
sequence on the left flank of the genome region to be deleted, followed by a sequence identical to 
a genome sequence on the right flank of the genome region to be deleted; the other end of the 
linear DNA molecule is a sequence identical to a genome sequence within the genome region to 
be deleted; between the two ends of the linear DNA, there is a recognition site that is not present 
in the genome of the bacterial strain and an antibiotic selection gene. The artificial DNA sequence 
can be made using polymerase chain reaction (PCR) or directed DNA synthesis. A PCR template 
for this purpose contains the unique recognition site and the genomic DNA sequences on both 
ends of the artificial linear DNA sequence are part of the primers used in the PCR reaction. The 
PCR template can be provided by a plasmid. An example of a plasmid that can be used as a 
template is pSG76-C (GenBank Accession No. Y09893), which is described in Posfai, G. et al., J. 
Bacteriol 179: 4426-4428 (1997). pSG76-CS (GenBank Accession No. AF402780), which is 
derived from pSG76-C, may also be used. pSG76-CS contains the chloramphenicol resistance 
(Cm R ) gene and two I-Scel sites, and was obtained by the PCR-mediated insertion of a second I- 
Scel recognition site into pSG76-C, downstream of the NotI site. The two I-Scel sites are in 
opposite direction. 

[0047] An artificial or constructed DNA sequence can be provided to a bacterium by 

directly introducing the linear DNA molecule into the bacterium using any method known to one 

of ordinary skill in the art such as electroporation. In this case, a selection marker such as an 
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antibiotic resistance gene is engineered into the artificial DNA sequence for purpose of selecting 
colonies containing the inserted DNA sequence later. Alternatively, a linear DNA molecule can 
be provided in a bacterium by transforming the bacterium with a vector carrying the artificial 
linear DNA sequence and generating a linear DNA molecule inside the bacterium through 
restriction enzyme cleavage. The restriction enzyme used should only cut on the vector but not 
the bacterial genome. In this case, the artificial linear DNA sequence does not have to cany a 
selection marker because of the higher transformation efficiency of a vector so that a bacterium 
with the inserted linear DNA can be screened by PCR later directly. 

[0048] The second step of the scarless deletion method involves replacement of a genomic 

region by insertion of the artificial DNA molecule. The bacterial cells are engineered to contain a 
system that increases the frequency of homologous recombination. An example of such a system 
is the Red recombinase system. The system can be introduced into bacterial cells by a vector. The 
system helps the linear DNA molecule to replace a genomic region which contains the deletion 
target. As described in the examples below, a vector carrying a homologous recombination 
system that can be used in E. coli is pBADccpy, which is described in Muyrers, J.P.P. et al., Nucl. 
Acids Res. 27:1555-1557 (1999). Another plasmid pKD46 described in Datsenko, K.A, et aL, 
Proc. Natl Acad. Sci. 97:6640-6649 (2000) may also be used. Other plasmids that can be used 
include pGPXX and pJGXX. pGPXX is derived from pBADapy by replacing the origin of 
replication in pBADafJy with pSClOl origin of replication. pJGXX is a pSClOl plasmid that 
encodes the Red functions from phage 933W under tet promoter control 

[0049] The third step of the scarless deletion method involves removal of the inserted DNA 

sequence. An expression vector for a sequence-specific nuclease such as I-Scel that recognizes 
the unique recognition site on the inserted DNA sequence is introduced into the bacteria. The 
sequence-specific nuclease is then expressed and the bacterial genome is cleaved. After the 
cleavage, only those cells in which homologous recombination occurs resulting in a deletion of the 
inserted linear DNA molecule can survive. Thus, bacteria with a target DNA sequence deleted 
from the genome are obtained. Examples of sequence-specific nuclease expression vectors that 
can be used in E. coli include pKSUCl, pKSUCS, pSTKST, pSTAST, pKTSHa, pKTSHc, 
pBADScel and pBADSce2. The sequence- specific nuclease carried by these vectors is I-Scel. 
pKSUCl, pKSUCS, pSTKST and pSTAST are described below in the examples. 
[0050] The method described above can be used repeatedly in a bacterium to generate a 

series of deletions. When the expression vector for the homologous recombination system and the 
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expression vector for the unique sequence-specific nuclease are not compatible with each other, 
such as the case for pBADapy and pKSUCl, transformation of the two vectors have to be 
performed for each deletion cycle. Transformation of the two vectors can be avoided in additional 
deletion cycles when two compatible plasmids, such as pB ADapy and pSTKST, or pKD46 and 
pKSUC5, are used. An example of using two of these vectors that are compatible with each other 
is described in the examples below. 

[0051] The above scarless deletion method can be modified to make a series of deletions on 

a bacterial genome more efficient. The first step of the modified method involves motiving 
insertions of a linear DNA molecule individually in bacterial cells, preferably wild-type bacteria 
cells, in a parallel fashion, resulting in a set of strains, each carrying a single insertion. This step 
can be carried out as described above. The second step of the modified method involves 
sequentially transferring individual insertions into the target cell whose genome is to be reduced. 
PI transduction is an example of the methods that can be used for transferring insertions. The 
third step of the modified method involves recombinational removal of the inserted sequence, 
which can be carried out as described above. 

Other DNA-based scarless deletion methods 
There are other alternative methods for deletion of genomic DNA from a bacterial strains. In the 
interests of brevity, they will not be discussed further here, except to acknowledge that they exist. 

Reduced genome E. coli competence 
[00521 As already mentioned above, the strain MDS39 carries a genome which is reduced 

in excess of 14% from that of its parental strain. This strain exhibited a high efficiency of 
transformation, equal to one of the best strains that is in common commercial use as an 
electroporation competent strain, even when grown at 37°C and without addition of sorbitol or 
other sugars. This high efficiency of transformation extended to BAC DNA of a large size that 
generally transforms with a much lower efficiency than small DNA. This makes the strain 
MDS39 particularly valuable as a vehicle for cloning DNA for the wide range of applications 
noted above, since not only is it potentially safer and cleaner because of the deletion of many 
unnecessary genes that might cause contamination, but it also exhibits an unusually high 
efficiency of transformation by electroporation. The beneficial characteristics of MDS39 were 
also present in an earlier strain in the genome deletion series, MDS31. 

[0053] Further experimentation have also shown that strain MDS41-R13 (41 deletions), the 

first multi-deletion strain to be completely clear of all Insertion Sequences (ISs) also is capable of 
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high efficiency transformation with both small plasmid (pBR322 and large DNA (lOOkb BAC). 
This shows that this beneficial characteristic is heritable in strains derived from MDS39, and 
indeed from MDS31 and even earlier strains in the deletion sequence, by additional genetic 
modification. It is not known if the trait of competence is created by the deletion of one or more 
of the genetic elements from the bacterial genome, or if it is a cumulative by-product of the many 
transformation events performed on the strains as the strains were made. To ensure that the use of 
this competent strain is enabled, samples of the bacterial strain MDS41-R13 are being deposited 

with the American Type Culture Collection, Manassas, Virginia, as accession number . 

[0054] While both E. coli MG1655 and MDS41-R13 grow well in LB medium, the 

stationary phase cell density is limited and the medium becomes quite alkaline, probably 
sufficiently alkaline to curtail growth. Furthermore, while strain MDS41-R13 grows as fast as 
MG1655 in the exponential growth phase, it exhibits a lag phase which is not exhibited by 
MG1655. Competent cells prepared from E. coli MDS41-R13 grown in LB show higher 
transformation efficiency than MG1655 grown under the same conditions. The transformation 
efficiency of MDS41-R13 is surprisingly high, given that the cells can be grown out at 37°C and 
no sorbitol or other sugars have been added. Data presented below shows that MDS41-R13 and 
MDS40 grow at about the same rate (as measured by the doubling times) in exponential phase as 
MG1655. In baffled flasks, the growth rates are higher (doubling time lower) than in the 
unbaffled flasks. Both MDS41-R13 and MDS40 exhibit the lag phase seen in the examples 
below, but not seen in MG1655. The longer the seed cultures are incubated, the longer is the lag 
phase. 

Methodology . 

[0055] In trying to improve the sub-optimal growth particularly of the MDS strains, we first 

tried adding glucose to the medium to provide a better carbon source, however, the medium 
became very acidic, and growth of MG1655 halted at a lower stationary phase cell density than in 
LB without glucose (OD650=1 .0 compared with 2.0 for LB). Adding phosphate to act as a pH 
buffer improved the growth, presumably by absorbing the hydrogen ions produced during growth 
on glucose. So-called Terrific Broth, which was originally developed for plasmid DNA 
production (Tartoff and Hobbs) has the composition, per liter of medium, Tryptone 12g, Yeast 
Extract 24g, potassium dihydrogen phosphate 9.4g, di-potassium hydrogen phosphate 2g, and 
glycerol as the carbon source. Glycerol is known to produce less acid in fermentations when it is 
used as the main carbon source. We discovered that E. coli strains MDS41-R13 and MG1655 
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grew extremely well in Terrific Broth (TB). While the doubling time, 36 minutes was very 
slightly slower in TB compared with LB, in Terrific Broth, MDS41-R13 reached 5-6 times the cell 
density of an LB culture, and MG1655 reached approximately 3 times the cell density. 
Furthermore, MDS41-R13 did not exhibit any lag phase in Terrific Broth. 
[0056] E. coli strain MDS41-R13 was found to transform by electroporation with high 

efficiency when grown in TB. We first tried harvesting the cells at OD650=0.5, as recommended 
for LB, and found that the transformation efficiency when used in electroporation was surprisingly 
high compared with cells grown in the conventionally used LB medium. We then tried harvesting 
at OD650=1 .0, and surprisingly, the transformation efficiency was still high. Even more 
surprisingly, when the cells were harvested at OD650=2 to 2.2, the cells were still highly efficient, 
as seen in the examples below. The literature teaches that electro-competence falls off rapidly if 
cells are allowed to grow to an OD greater than 0.5 to 0.6. The efficiency of some of the cells 
prepared from cultures grown in TB to an OD650 of around 2 has exceeded 10 10 transformants per 
microgram of pUC19. This is an unexpectedly high level of transformation, not only because of 
the high cell density at harvest, but also in view of the fact that the cultures were grown at 37°C, 
which the literature teaches leads to lower transformation efficiency than growth at 18-26°C. 
[0057] It is believed that the superiority of the Terrific Broth treatment is due to the use of 

glycerol as the main carbon source. By using glycerol as the carbon source, the cultures produce 
less acid and therefore grow longer under limited oxygen concentrations, as compared with what 
would be the case with other carbon sources. Good nutrient concentrations and good pH buffering 
may also play a part. It is also an advantage of this medium that the cultures may be maintained at 
higher temperatures, over 28°C, and as high as about 37°C. This temperature permits continued 
grow of the culture. While the results here were obtained with baffled flasks, the same results can 
be expected in commercial fermentors and other culture vessels. 

[0058] The above disclosure generally describes the present invention. The invention will 

be more fully understood upon consideration of the following examples which are provided herein 
for purposes of illustration only and are not intended to limit the scope of the invention. 
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EXAMPLES 

Plasmids 

[0059] The plasmid used for PCR construction of the artificial inserted DNA sequence was 

designated pSG76-CS (GenBank Accession No. AF402780), which was derived from pSG76-C 
(Posfai, G. et al, J. Bacteriol 179: 4426-4428 (1997)) by inserting a second I-Scel site. The 
second I-Scel site was obtained by the PCR-mediated insertion of a second I-Scel recognition site 
into pSG76-C, downstream of the NotI site. The two I-Scel sites are in opposite direction. 
[0060] The pBADaPy plasmid was used for enhancing recombination of linear DNA- 

fragments into the genome. This plasmid was described in Muyrers, J.P.P. et al, Nucl Acids Res. 
27:1555-1557(1999). 

[0061] The PKSUC1 plasmid (GenBank Accession No. AF402779), for expressing I-Scel, 

was derived from pSG76-K (Posfai, G. et al, J. Bacteriol 179: 4426-4428 (1997)) and 
pUC19RP12 (Posfai, G. et al., Nucl Acids Res. 27: 4409-4415 (1999)). The Xbal-NotI fragment 
(carries the Kan gene; the NotI end was blunted by Klenow polymerase) of pSG76-K was ligated 
to the Xbal-Dral fragment (carries the I-Scel gene and the pUC ori) of pUC19RP12. 
[0062] The pKSUC5 plasmid for tetracycline-regulated expression of I-Scel was derived 

from pFT-K (Posfai, G. et al, J. Bacteriol 179: 4426-4428 (1997)) and pKSUCl. The large 
Xbal-Ncol fragment of pKSUCl was ligated to the Xbal-Ncol fragment of pFT-K carrying the tet 
repressor. 

[0063] The PKD46 plasmid for enhancing recombination of linear DNA-fragments into the 

genome was described in Datsenko, K.A. et al, Proc. Natl Acad. Sci. 97:6640-6649 (2000). 
[0064] The plasmid pSTKST (GenBank Accession No. AF406953) is a low copy number 

Kan R plasmid for chlortetracycline-regulated expression of I-Scel, derived from pFT-K (Posfai, G. 
et al, J. Bacteriol 179: 4426-4428 (1997)) and pUC19RP12 (Posfai, G. et al, Nucl Acids Res. 
27: 4409-4415 (1999)). The Xbal-PstI fragment from pUC19RP12, carrying the I-Scel gene, was 
ligated to the large Xbal-PstI fragment of pFT-K. This plasmid expresses I-Scel when induced by 
chlortetracycline. Replication of the plasmid is temperature-sensitive (Posfai, G. et al, J. 
Bacteriol 179:4426-4428(1997)). 

[0065] The plasmid pSTAST, a low copy number Ap R plasmid for chlortetracycline- 

regulated expression of I-Scel, was derived from pFT-A (Posfai, G. et al, J. Bacteriol 179: 4426- 
4428 (1997)) and pUC19RP12 (Posfai, G. et al, Nucl Acids Res. 27: 4409-4415 (1999)). The 
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Xbal-PstI fragment from pUC19RP12, carrying the I-Scel gene, was ligated to the large Xbal-PstI 
fragment of pFT-A. This plasmid expresses I-Scel when induced by chlortetracycline. Replication 
of the plasmid is temperature-sensitive (Posfai, G. et al., J. Bacteriol 179: 4426-4428 (1997)). 

Procedure 1 

[0066] This describes the process used to repeatedly make deletions from the genome of E. 

coli K12. This procedure is a scarless deletion method. The procedure begins with the 
construction of a linear target fragment by PCR. This was done by mixing 20 pmol of primer A 
with 20 pmol primer B, and performing PCR in a total volume of 50 pi. The cycle parameters 
used were 15x(94°C 40sec/57°C or lower (depending on the overlap of A and B) 40sec/72°C 
15sec). The 1 nl of the PCR mix above were taken, added to 20 pmol of primers A and C each, 
add 50 ng of pSG76-CS and perform PCR in a volume of 2x50 \i\ (use 50-(al tubes, and two tubes 
are combined to have more DNA). The cycle parameters used were 28x(94°C 40sec/57°C 
40sec/72°C 80sec). To purify the PCR mix from the above step, Promega Wizard PCR 
purification kit was used. The resulting DNA fragment was suspended in 20 fil water. 
[0067] Next was the replacement of a genomic region by insertion of the artificial DNA- 

fragment. This was done by taking the target cell carrying pBADaPy and preparing 
electrocompetent cells as described (Posfai, G. et al., Nucl Acids Res. 27: 4409-4415 (1999)), 
except that 0.1% arabinose was added to the culture 0.25 - 1 hour before harvesting the cells. 4 p.1 
of DNA fragments (100-200 ng) were electroporated into 40 nl of electrocompetent cells. The 
cells were plated on Cam plates (25 jig cam/ml) and incubated at 37°C. The usual result was to 
obtain a total of 10 to several hundred colonies after overnight incubation. A few colonies were 
checked for correct site insertion of the fragment by PCR using primers D and E. 
[0068] Next was the deletion of the inserted sequences. This was done by preparing 

competent cells derived from a selected colony from above by the CaCl2 method (Sambrook, J. et 
al., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY (1989)). The plasmid pKSUCl (-100 ng) was transformed into the cells by standard 
procedures (Sambrook, J. et al., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY (1989)). The cells were plated on Kan plates and 
incubated at 37°C (pKSUCl and pBADaPy are incompatible, thus selection on Kan eliminates 
pBADaPy from the cells). The colonies were checked for correct deletion by PCR using primers 
D and E. A colony was selected carrying the correct deletion. At this point, the cells carried 
pKSUCl. The next step is to delete this plasmid. 
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[0069] This deletion is done through the replacement of pKSUCl with pBADaPy. A 

colony from the prior step was selected, grown in LB at 37°C under nonselective conditions, 
reinoculating the cells into fresh medium 2-3 times. Competent cells were prepared for either 
chemical transformation or electroporation. The plasmid pBADaPy (100-200 ng) was 
transformed into the competent cells which were plated on Amp plates. A colony which was Kan 
sensitive/Amp resistant was selected by toothpicking a hundred colonies on Kan and Amp plates. 
[0070] The selected colony can be used in a next round of deletion by using a new targeting 

fragment and repeating the steps above. If no more deletions are needed, growing the cells under 
nonselective conditions (no Amp is added) results in the spontaneous loss of pBADaPy from a 
large fraction of the cells. 

Procedure 2 

[0071] This procedure is similar to procedure 1, but pKSUCl is replaced by pSTKST. This 

plasmid is compatible with pBADaPy, has a temperature-sensitive replicon, and expression of I- 
Seel requires induction by chlortetracyclin (CTC). The advantage is that elimination of pSTKST 
from the cell is easily accomplished by growing the culture at 42°C. 

[0072] Construction of a linear targeting fragment by PCR and replacement of a genomic 

region by insertion of the fragment are done as described in Procedure 1 . 

[0073] To delete the inserted sequences competent cells are prepared from a culture derived 

from a selected colony harboring the right insertion. Cells are transformed by pSTKST , plated on 
Kan+Cam plates and incubated at 30°C. A colony from this plate is inoculated into 10 ml of 
LB+Kan supplemented with heat-treated inducer cTc (25 /xg/ml final concentration) and grown at 
30°C for 24 hours. This step serves induction of the expression of I-Scel. Dilutions of the culture 
are then spread on LB+Kan plates and incubated overnight at 30°C. 6-12 colonies were checked 
for correct deletion by PCR using primers D and E. A colony was selected carrying the correct 
deletion. 

[0074] To eliminate the helper plasmids from the cell, the culture is grown at 42°C in LB 

(no antibiotics added). 

Procedure 3 

[0075] Since pBADaPy and pSTKST carry compatible replicons, repeated transformations 

of the plasmids are not required when consecutive deletions are made in the same host. The two 
plasmids are maintained in the host cell throughout consecutive deletion constructions by 
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antibiotic selection (Kan+Amp). Recombinase and specific nuclease functions are induced only 
when needed. Since replication of pSTKST is temperature-sensitive, cells must be grown at 30°C. 
[0076] The procedure is identical to Procedure 2, except that pBADapy and pSTKST are 

transformed into the cell only once, and until maintenance of both plasmids in the cell is desired, 
the culture is grown at 30°C, and Amp+Kan are included in the medium. Note: Sometimes we 
experienced difficulties in growing the cells at 30°C in the presence of two (Amp+Kan) or three 
(Amp+Kan+Cam) antibiotics. 

Procedure 4 

[0077] This is the preferred procedure when several consecutive deletions are to be made in 

the same cell. Insertions (recombination of linear fragments into the genome of a host cell carrying 
pBADaPy) are made in parallel, creating a series of recombinant cells, each carrying a single 
insertion. These insertions are then transferred one by one by PI transduction into the cell carrying 
pSTKST and harboring all previous deletions. Removal of all foreign sequences is done in this 
final host by inducing pSTKST. Compared to the previous methods, the main difference is that 
the insertion step and removal of the inserted sequences are done in separate cells. Since insertions 
are made in parallel, the construction of consecutive deletions is faster. Another advantage is that 
cells are transformed by the plasmids only at the beginning of the first deletion construction. 
[0078] Technically the procedure is identical to Procedure 2, except that individual 

insertions are transferred by PI transduction to the deletion strain already harboring pSTKST. 
After each PI transduction step, I-Scel expression is induced to remove the inserted sequences. 

Results 

[0079] Forty-one consecutive genomic deletions have been made from E. coli strain K12 

MG1655. The twelve deleted regions were selected for deletion, in part, as a result of comparison 
of the genomic DNA sequences of E. coli strain 0157:H7 EDL933 and strain K-12 MG1655. The 
deletions are listed on Table 1 below. The sequence numbering is taken from the published K12 
sequence. 

[0080] The first deletion MD1 was made using the method described in Posfai, G. et aL, 

Nucl Acids Res. 27: 4409-4415 (1999). Using this method for creating MD1 deletion left a 1 14- 
bp pSG76-CS vector sequence, including a FRT site, in the chromosome at the site of deletion. 
MD2 through MD6 deletions were made using Procedure 1 described above. Deletions MD7 
through MD12 were created using a combination of Procedure 4 and Procedure 1 or 2. Strain 
designations and genomic coordinates of each new deletion were: MD1 263080-324632; MD2 
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1398351-1480278; MD3 2556711-2563500; MD4 2754180-278970; MD5 2064327-2078613; 
MD6 3451565-3467490; MD7 2464565-2474198; MD8 1625542-1650865; MD9 4494243- 
4547279; MD10 3108697-3134392; MD11 1196360-1222299; MD12 564278-585331. 
[0081] A total of 378,180 base pairs, which is approximately 8.1% of the native K12 

MG1655 E. coli genome, was removed at this stage. Removing these regions from the genome 
did not affect bacterial survival or bacterial growth. 

[0082] Table 2 below lists other segments, genes and regions of the E. coli genome that 

were identified as candidates for further deletions. The segments have also been successfully 
removed from the genome of the bacteria. Again, these deletions were made without any apparent 
deleterious effect on the usefulness of the bacteria for laboratory and industrial use. Again the 
sequence designations are taken from the published K12 sequence. The two sets of deletions 
totaled about 14% of the original bacterial genome. Further deletions are, of course, possible. 
[0083] Transformation competence. 

[0084] Making electrocompetent cells 

[0085] Electrocompetent cells were prepared as follows. Cells were streaked out from a 

glycerol stock onto an LB plate, grown overnight at 37°C. Next day (sometimes colonies from 
older plates were used) a single colony was inoculated into 10 ml LB in a 50-ml flask, incubated at 
200-220 rpm in a shaker at 37°C overnight. From this dense starter culture, 800 jlxI was inoculated 
into 100 ml LB in a 500 ml Erlenmeyer flask, incubated in an orbital shaker at 37°C at 200-220 
rpm. 

[0086] At OD=0.4 the flask was placed on ice for 20 min. From this point all manipulations 

were made at 0°C, tubes and solutions were chilled before use. 



[0087] 


1. Cells were centrifuged in 2x50 ml sterile tubes at 4000 rpm/10 min. 


[0088] 


2. Cells were suspended in 2x50 ml sterile water. 


[0089] 


3. Cells were centrifuged at 4000 rpm/10 min. 


[0090] 


4. Cells were suspended in 2x50 ml sterile water. 


[0091] 


5. Cells were centrifuged at 4000 rpm/10 min. 


[0092] 


6. Cells were suspended in 2x2 ml 10% glycerol. 


[0093] 


7. Cells were centrifuged at 4000 rpm/10 min. 


[0094] 


8. Cells were suspended in 2x100 pi 10% glycerol. 
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[0095] 9. The two tubes were combined, and cells were distributed into 40 jil aliquots in 

Eppendorf tubes (>5 tubes), frozen on dry ice and stored at -80°C until use. Total cell count/tube 
was between 10 9 and 10 10 . 

[0096J Electrocompetent cells were melted by keeping the tube in the hand, then placed on 

ice. DNA and 0.1 cm cuvettes were also kept on ice. 2 jil DNA (normally diluted to 10 ng/^il in 
TE) was added to the cells, mixed briefly, then pipetted in the cuvette. The cuvette was placed in 
an Invitrogen II device (settings: 1800 V, 50 |uF, 150 Q), and an electric pulse was applied. Cells 
were washed out from the cuvettes by pipetting 1 ml LB (for simplicity we used LB as recovery 
medium, in earlier experiments we used SOC medium) up and down in the cuvette. The mixture 
was transferred into a test tube and placed in a shaker for 1 h at 37°C. Appropriate dilutions were 
then spread on LB+antibiotics plates. 

[0097] Plasmid DNAs (both small and large) were prepared according to the Promega 

Wizard medium prep protocol. (Growth in LB+antibiotics o/n, standard alkali lysis protocol, 
Promega Wizard Midicolumns, DNA recovered in TE buffer). Plasmids pBR322, pUC19 and 
pBeloBAC (100-kb insert) were prepared from DH5a cells. The results of the transformation 
events is summarized as follows. 

DH10B MDS31 MDS39 

Transformants per microgram 

PBR322 2xl0 8 2.2xl0 8 2.7xl0 8 

Methylated BAC 2xl0 6 0.6xl0 6 1.2xl0 6 

UnmethylatedBAC 1.8xl0 6 4xl0 3 * 3xl0 6 

[0098] Electroporation of MDS41-R13 and MG1655 and DH10B 

Methods were as described in the previous paragraph. The DNA was pBR322 or a lOOkb BAC 
DNA. 





Electroporation efficiency, relative to MG1655 




MDS41-R13 


MG1655 


DHlOb 


pBR322 


0.98 


1.00 


0.73 


lOOkb BAC 


4.1* 


1.00 


0.99 



Absolute value 2x10 s transformants/microgram 
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[0099] Electroporation of MDS41-R13, MG1655 and DH10B. 

Methods were as described above. The DNA was pBR322 DNA and pBeloBAC DNA 





Electroporation efficiency transformants /microgram 




MDS41-R13 


MG1655 


DHlOb 


pBR322 


3.34xl0 9 


2.4xl0 9 


3.3xl0 9 


PBeloBAC 


1.9xl0 5 







[00100] Growth of E. coii MDS41-R13 and MG1655 in LB medium. 

50ml of sterile LB medium in two sterile 250ml non-baffled Erlenmeyer flasks with 
polypropylene caps were each inoculated with a loopfol of frozen glycerol stocks of MG1655 and 
MDS41-R13 respectively. They were grown overnight at 37°C in an orbital shaker at 250rpm 
(seed cultures). The OD650 of each culture was determined, after dilution, if needed, to stay 
within the linear part of the spectrophotometer range, using a microplate spectrophotometer. pH 
was determined using a standardized and calibrated pH meter and combination electrode. Fresh 
50ml cultures were inoculated with 500 microliters of the overnight cultures and grown in the 
same way as the overnight cultures. At intervals, samples were removed from the cultures and 
the OD650 and pH were measured as described above. The doubling time of the cultures during 
the exponential phase of growth was calculated from the slope of a graph of log OD650 versus 
time. 





MDS41-R13 


MG1655 


Stationary phase OD650, 
overnight seed culture 


1.92 


2.22 


pH, overnight seed culture 


8.05 


8.45 


Growth rate in exponential 
growth phase 


53 minutes 


51 minutes 


Lag phase, MDS41-R13 
versus MG1655 


72 minutes 


NA 



[00101] Electroporation of competent E. coii grown in LB medium 

Overnight seed cultures were prepared as described above. 

From this seed culture, 500 fil was inoculated into 50 ml LB in 250 ml baffled Erlenmeyer 
flasks, incubated in an orbital shaker at 37°C at 200-220 rpm and the OD650 of the culture was 
monitored as described above. For greater quantities of cultures, additional cultures were set up 
as described above and grown in parallel. When the OD650 reached the desired value, the flask 
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was placed on ice to reduce the temperature to 0-4°C. From this point all manipulations were 
made at 0-4°C. All tubes and solutions were chilled before use. 

1 . Cultures were centrifuged in sterile 250 ml centrifuge bottles in the GSA rotor of a Sorvall 
RC2B or RC5B centrifuge for 10 minutes at 5,000 rpm at 4°C to separate the E. coli cells as a 
pellet. 

2. Cells were suspended in 250 ml sterile MilliQ water. 

3. Cells were centrifuged at 5,000 rpm for 5 min 

4. Cells were suspended in 250 ml sterile MiliQ water. 

5. Ceils were centrifuged at 5,000 rpm for 5 min 

6. Cells were suspended in 250 ml 10% or 15% glycerol 

7. Cells were centrifuged at 5,000 rpm for 5 min 

8. Cells were suspended in a volume of 10% or 15% glycerol approximately equal to the volume 
of the pellet. The OD650 of the final cell suspension was determined after 200x dilution in the 
microplate spectrophotometer. 

9. The final cell suspensions were used fresh for electroporation, or were distributed into 40 fx\ 
aliquots in 1.5ml Eppendorf tubes, frozen on dry ice and stored at -80°C in a freezer until use. 

Frozen cell suspensions were melted by keeping the tube in the hand, then placed on ice. DNA 
and 0.1 cm cuvettes were also kept on ice. 1 DNA (normally diluted to 10 ng//xl in TE) was 
added to the cells, mixed briefly, then pipetted into the cuvette. The cuvette was placed in an 
Eppendorf device (settings: 1800 V), and an electric pulse was applied. Cells were washed out 
from the cuvettes by pipetting 1 ml recovery medium up and down in the cuvette. The mixture 
was transferred into a test tube and placed in a shaker for 1 h at 37°C. Appropriate dilutions were 
then spread onto agar plates containing LB and an appropriate antibiotic for selection of the 
transformants. The concentration of antibody was used that suppressed the growth of the 
untransformed cells but not of the transformed cells. 



-25- 



Specific conditions used: 

OD650 of culture at harvest = 0.60 
Transforming DNA = pUC19 lOpg/transformation 
Transformed fresh 
Recovery medium = LB 





Transformation frequency 
(transformants/microgram DNA) 




MDS41-R13 


MG1655 


PUC19 


9.3 x 10 8 


4.6 x i0 7 



[00102] Growth of E. coli MDS41-R13, MDS40 and MG1655 in LB medium 

The experiment was done as described above, with variations. 
Specific variations in conditions used were as follows: 



Seed culture - 250ml non-baffled Erlenmeyer flask 
Main culture - 250ml baffled Erlenmeyer flask 
Medium = LB 





MDS41-R13 


MDS40 


MG1655 


Stationary phase OD650 - 
seed culture 


0.96 


1.34 


2.00 


pH - seed culture 


8.43 


8.81 


8.90 


Doubling time in 
exponential phase - main 
culture 


33 min 


33 min 


33 min 


Lag phase MDS41-R13 
compared with MG1655 - 
main culture 


84 min 


95 min 




Stationary phase OD650 - 
main culture 


ND 


ND 


2.22 


Stationary phase pH - main 
culture 


8.43 


8.80 


8.90 
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[00103] Growth of E. coli MDS41-R13 and MG1655 in Terrific Broth 

The experiment was done as above, with variations. The specific variation in conditions used 
were as follows: 



Seed culture - 250ml non-baffled Erlenmeyer flask 
Main culture - 250ml baffled Erlenmeyer flask 
Medium = Terrific broth 





MDS41-R13 


MG1655 


Stationary phase OD650 - seed culture 


3.97 


3.30 


pH - seed culture 


5.63 


5.45 


Doubling time in exponential phase - main 
culture 


36 min 


36 min 


Lag phase MDS41-R13 compared with 
MG1655 - main culture 


0 min 




Stationary phase OD650 - main culture 


6.76 


6.28 


Stationary phase pH - main culture 


5.63 


5.45 



[00104] Electroporation of competent E. coli MDS41-R13 and MG1655 grown in Terrific 
Broth at high cell concentration and 37°C. 

Competent cells were prepared and electroporated as desribed above, with the following 
differences: 

Medium (seed and main cultures) = Terrific Broth 

Stationary phase OD650 of seed culture MDS41-R13 = 6.06 & MG1655 = 5.37 
Seed culture diluted 1:1000 into main culture medium 

Main cultures (duplicate) harvested at MDS41-R13 = OD 2.17 & MG1655 = 2.26 
Transforming DNA = pUC19 at lOpg/transformation 
Transformed fresh 

Concentration of glycerol in final cell suspension = 15% 
Recovery medium = Terrific Broth 





Mean transformation efficiency 
(transformants/microgram DNA) 




MDS41-R13 


MG1655 


pUC19 


4.10 x 10 9 


3.41 x 10 7 
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[00105] Electroporation of competent E. coli MDS41-R13 and MG1655 grown in Terrific 
Broth at high cell concentration and 37°C. 

Competent cells were prepared and electroporated as described above, with the following 
differences: 

Medium (seed and main cultures) = Terrific Broth 

Stationary phase OD650 of seed culture MDS41-R13 = 6.99 & MG1655 = 5.67 
Seed culture diluted 1:1000 into main culture medium 

Main cultures (duplicate) harvested at MDS41-R13 = OD 2.02 & MG1655 = 2.06 
Concentration of glycerol in final cell suspension = 15% 
Transforming DNA = pUC19 at lOpg/transformation 
Transformed fresh and after freezing at -80°C 
Recovery medium = Terrific broth 





Mean transformation efficiency 
(transformants/microgram DNA) 




MDS41-R13 


MG1655 


pUC19 transformed fresh 


1.1 xlO 10 


1.64 x 10 s 


pUC19 transformed after 
freezing 


2.6 x 10 9 


ND 



[00106] In the above description, the present invention is described in connection with 
specific examples. It will be understood that the present invention is not limited to these 
examples, but rather is to be construed to be of spirit and scope defined by the appended claims. 
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Description b 


b0246-b0310: includes K-islands #16. 17. 18. CP4-6. eaeH 


b1336-b141 1; includes K-island #83, Rac 


b2441-b2450; includes K-island #128. CP-Eut 


b2622-b2660; includes K-island #137. CP4-57. ileY 


b1994-b2008; includes K-islands # 94. 95, 96, CP4-44 


b3323-b3338; includes K-islands #164. 165 


b2349-b2363: includes K-island #121 


b1539-b1579; includes K-island #77, Qin 


b4271-b4320; includes K-island #225. fee ODeron. fim ooeron 


b2968-b2987; includes K-island #153, qlc ODeron 


b1137-b1172; includes K-island #71. e14 


b0538-b0565; includes K-island #37. DLP12 


Size (bp) 


61553 


81928 


6790 


35091 


14287 


15926 


9634 


25244 


53037 


25696 


25940 


21054 


to 

<Z> 

c 

'o 

Q 
"O 

c 
LU 


263080. 324632 


1398351. 1480278 


2556711.2563500 


2754180, 2789270 


2064327, 2078613 


3451565. 3467490 


2464565. 2474198 


1625542, 1650785 


4494243. 4547279 


3108697.3134392 


1196360. 1222299 


564278, 585331 


Deletion 


MD1 


MD2 


MD3 


MD4 


MD5 


MD6 


MD7 


MD8 


MD9 


MD10 


MD11 


MD12 



TABLE 2 

SECOND SET OF COMPLETED DELETIONS 

IS 186 deletions (3) 

keepdnaJ 14168,15298 (+) 

♦delete GP1 15388,20563 IS186, gef, nhaAR, IS1 

[IS186 15388,16730] 

[IS1 19796,20563] 
keeprpsT 20815,21078 (-) 



keep pheP 601 182,602558 (+) 

♦delete GP2 602639,608573 ybdG, nfeB, ubdF, ybdJ, ybdK, IS186 

{IS 186 607231,608573] 
keepentD 608682,609311 (-) 



keep glk 2506481,2507446 (-) 

♦delete GP3 2507650,2515969 b2389, b2390, b2391, b2392, nupC, IS186, yfeA 

[IS186 2512294,2513636] 
keepalaX 2516061,2516136 (-) 



IS2 deletions (3 not already deleted) 

Keep yaiN 378830,379126 (-) 

♦delete GP4 379293,387870 yaiO, b0359, IS2, b0362, yaiP, yaiS, tauABCD 

[IS2 380484,381814] 
keep hemB 387977,388984 (-) 

♦delete GP5 389121 ,399029 b0370, yaiT, IS3, yaiU, yaiV, ampH, sbmA, yaiw, yaiY, yaiZ 

[IS3 390933,392190] 
keep ddlA 399053,400147 (-) 



keep ygeK 2992482,2992928 (-) 

♦delete GP6 2992959,2996892 b2856, b2857, b2858, b2859, IS2, b2862, b2863 

[IS2 2994383,2995713] 
keep glyU 2997006,2997079 (-) 

keep ribB 3181 829,3 1 82482 (-) 

♦delete GP7 3182796,3189712 b3042, ygiL, IS2, yqiGHI (fimbral locus) 

[IS2 3184112,3185442] 
keepglgS 3189755,3189955 (-) 



IS5 deletions (6 not already deleted) 

keep ybeJ 686062,686970 (-) 



♦delete GP8 
keep lnt 
keep tpx 
♦delete GP9 

[IS5 
keep fnr 



687074,688268 IS5 
688566,690104 (-) 
1386329,1386835 (-) 

1386912,1396646 ycjG, ycjl, ycjY, ycgZ, mppA, ynal, IS5, ynaJ, ydaA 

1394068,1395262] 

1396798,1397550 (-) 



keep gnd 2097884,2099290 (-) 

♦delete GP 1 0 20994 1 8,2 135739 IS5 plus entire O Antigen and Colanic Acid clusters 

[IS5 2099771,2100965] 
keep yegH 2135858,2137507 (+) 



keepproL 2284231,2284307 (+) 

♦delete GP1 1 2284410,2288200 yejO and IS5 

[IS5 2286939,2288133] 
keepnarP 2288520,2289167 (+) 

-30- 



keepgltF 3358811,3359575 (+) 

♦delete GP12 3359747,3365277 IS5 plus yhcADEF (K-island) 

[IS5 3363191,3364385] 
keep yhcG 3365462,3366589 (+) 

keep arsC 3647867,3648292 (+) 

♦delete GP13 3648921,3651343 yhis and IS5 

(IS5 3640666,3650860] 
keep sip 3651558,3652157 (+) 



keep mviN 1 127062,1 128597 (+) 

♦delete GP14 1 128637,1 140209 flgAMN flgBCDEFGHIJKL 
keep rne 1 140405,1 143590 (-) 



keep yecT 1959975,1960484 (+) 

♦delete GP 1 5 1 960605, 1 977294 flh, che, mot, tap, tar, IS 1 
keep yecG 1977777,1978205 (+) 

Regions Ilia and Illb try deleting both in one action 
keepsdiA 1994133,1994855 (-) 

♦delete Gpl6 1995085,2021700 fli, plus amyA, yec and yed ORFs 
keep res A 202 1 990,20226 1 3 (+) 



hsd region 

keep uxuR 4552145,4552918 (+) 

♦delete GP17 4553059,4594581 yji ORFS, plus mcrBCD, hsdRMS, mrr, tsr 
keep mdoB 45947 1 9,459697 1 (-) 



Rhs elements 

keep ybbP 519640,522054 (+) 

♦delete GP 1 8 522062,529348 RhsD element & associated ORFs 
keep ybbB 529356,530450 (-) 

keep ybfA 728357,728563 (+) 

♦delete GP19 728616,738185 RhsC element & associated ORFs 
keep ybgA 738224,738733 (+) 

keep yncH 1524964,1525176 (+) 

♦delete GP20 1525914,1531648 RhsE element & associated ORFs 
keep nhoA 1532048,1532893 (+) 

keep nikR 3616219,361662 (+) 

♦delete GP21 3616623,3623309 RhsB element & associated ORFs # 

may need to leave something here to separate converging ORFs? 
keep yhhJ 3623310,3624437(0 

keep yibF 3758974,3759582 (-) 

♦delete GP22 3759620,3767868 RhsA element & associated ORFs 
keep yibH 3767870,3769006 (-) 



flagella 
Region I 



Region II 
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the rest of the IS elements 

keep appA 1039840,1041 138 (+) 

*delete GP23 1041253,1049768 yccZYC (EPS), ymcDCBA (EPS?), ISl 

[ISl 1049001,1049768] 
keepcspH 1050186,1050398 (-) 

keep phoH 1084215,1085279 (+) 

♦delete GP24 1085329, 1096603 ycdSRQPT (hms homologues), IS3, ymdE, ycdU 

[IS3 1093468,1094725] 
keepserX 1096788,1096875 (-) 

keep baeR 2162298,216302 (+) 

♦delete GP25 2163172,2175230 P2 remnant, IS3, gat operon 

[IS3 2168193,2169450] 
keep fbaB 2175532,2176656 (-) 

keep yhhX 3577399,3578436 (-) 

♦delete GP26 3578769,3582674 yhhYZ, ISl, yrhAB 

[ISl 3581059,3581826] 
keep ggt 3582712,3584454 (-) 

keep cspA 3717678,3717890 (+) 
♦delete GP27 3718262,3719704 IS 150 
[IS 150 3718262,3719704] 

keepglyS 3719957,3722026 (-) 
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